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Abstract 

It is already known that in multicast (single source, multiple sinks) 
network, random linear network coding can achieve the maximum flow 
upper bound. In this paper, we investigate how random linear network 
coding behaves in general multi-source multi-sink case, where each sink 
has different demands, and we characterize all achievable rate of random 
linear network coding by a simple maximum flow condition. 

1 Introduction 

In an information transmission network, allowing coding operation at interme- 
diate nodes will increase the capacity of the network than simply relaying the 
packets. In multicast (single-source, multiple sinks) scenario, Ahlswede, Cai, 
Li, and Yeung proved that the maximum flow upper bound can be achieved by 
network coding in their seminal paper [5]. In 2003, Li, Yeung, and Cai proved 
that [3] linear network coding, i.e., only linear encoding and decoding is allowed, 
is sufficient to achieve the maximum flow upper bound. Later in 2003, Koetter 
and Medard formulated and dervied Li et. al.'s results using algebraic methods 
[5]. In 2006, Ho et. al. showed that, in fact, performing random linear network 
coding will achieve the upper bound, when the underlying coding field is large 
enough. Due to its simplicity, random linear network coding turns out to be a 
practial solution. 

In contrast to the multicast scenario, the general case (multi-source and 
multi-sink with arbitrary demands) is not well understood. In [5^, Koetter and 
Medard reduces the existence of linear network coding solution to the exsitence 
of a point in certain algebraic variety, which, in general, is NP-complete. In 
2005, Dougherty, Freiling, and Zeger showed that the linear network coding is 
not sufhcient in the general case [6]. In [9^, Yan, Yeung, and Zhang characterized 
the capacity region for multi-source multi-sink network coding. However, the 
region is difficult to compute. In fact, even approximating the capacity or linear 
capacity of network coding within any constant was proven to be hard . 

In this paper, we investigate how random linear network coding behaves 
in multi-source multi-sink network. It turns out that it will work in certain 
occasions, which can be easily characterized by a simple maxflow condition. 
And there is also a dichotomy of random network coding in the general case: it 
will work with probability — !■ 1 or fails with probability — ^ 1 when the size of 
the encoding field tends to infinity. 
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2 Notations 



Let's consider a multi-source multi-sink acyclic network, which consists of a 
directed acyclic graph G = {V,E), sources S = {si, S2, • ■ ■ , Sm} C V, sinks 
T = {t\, t2, ■ ■ ■ , tn}, for each t £T, demand dem(i) C [m], which indicates that 
t need to receive all symbols from source {si : i € dem{t)}. Rate (ri, . . . , r,n) G 
N"* means, for each Si sends symbols at a time over some underlying finite 
field F. 

When rate (ri, . . . , r„j) is fixed, for convenience of description, let's add m 
extra vertices s|, . . . , s*j, and also add edges from s* to Si. And denote by 
S* = {si, s'^}, r ^ri + ... + r„i. 

If Ui is a subspace F*", and C/2 is a complement of [/i, then every vector 
a e F'' can written uniquely as a sum of a vector in C/i and U2, which is 
denoted by 

a = a\ui + a\u2, 

where a\u-^ G Ui and alu^ £ f^2- In abuse of notation, given a basis of F'' 
including u, we denote by a\u the coefficient of u when expressing a in this 
basis. 

A linear network coding ip : E t-^ ¥^ is recursively defined as follows 

I fei , e is the j*'' edge from s* to Si , 

*'='nE..U,.„....,.,c.«e.), 0.1™, 

where {bij, l<i<m, l<j< Si}, is the standard orthogonal basis of F*", 
and Ci € ¥ are coefficients. When performing a random linear network coding, 
coefficients S F are chosen uniformly at random. If we take dem(i) = [m] for 
every source i, we obtain the multicast network coding theorem [2], [3], [S]- 

Let maxflow(s,t) denote the maximum number of edge-disjoint paths from 
s to t, and maxflow(5', t) denotes the maximum number of edge-disjoint paths 
from some s € S to t. By the maxfiow-mincut theorem, we know in both cases, 
the value equals the minimum size of the s-t (S-t) cut. 

3 Main result 

Lemma 1. Assume ijj is a linear network coding. Let Si C S, and Ui be the 
subspace spanned by {bij '■ Si (z Si,l < j < ri}. If (V5, Vr) is a Si-t cut, then 

ip{e)\ui S span{'(/'(eO|(7i : € (Vs,Vt)} 

for every edge e with head(e) = t. 

Proof. Let e by any edge with head(e) = t. By the definition of linear network 
coding, we have 

'(/'(e) e span{-i/;(ei) : head(ei) = tail(e)} 

e span{?/;(e2) : (3ei)(head(ei) = tail(e) and head(e2) = tail(ei))} 

e span{V'(ei) : head(ei) G S\Si, or (3e; e E){ei e and e; e {Vs, Vr))}- 
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Note that ip{ei)\ij-^ — for all tail(e;) E S \ Si, we claim 

ip{e)\ui e spa,n{i: {ei)\u, ■ ei G {Vs,Vt)}, 

which completes our proof. □ 

The next lemma explains why random linear network coding can achieve 
maximum flow bound in multicast (15*1 = 1) network. 

Lemma 2. Assume pi = (ei_i, . . . ei^;^), . . . ,pt = (ef_i, . . . , et^;J are t edge- 
disjoint paths. In random linear network coding, if ip{ei^i),^p{e2,i), ■ ■ ■ jipi^m,!) 
are linearly independent, then ip{ei,i^), ^^(62, is), ■ ■ ■ , 'ipi^t.it) "'^e linearly indepen- 
dent with probability — > 1 when |F| — > +00. 

Proof. Since G is acyclic, there is a topological order to add the edges one by 
one such that when is added, all edges ej with head(ej) = tail(ei) are already 
added. Let's add edges in this order to perform random linear network coding. 
Assume edges ei^i^ £ pi, . . . , et^t G Pt are added, and assume w.l.o.g the next 
edge to add is eis^+i, it suffices to show 

Pr{?/;(ei.i^_i_i), -0(62,12), • ■ • ,"0(6*, it) are linearly independent 
I ip{ei^i-^), . . . ,ip{et,it) are linearly independent } > 1 — p:]-, 

which will imply V(ei,;i), '0(62, /s), "0(6*, /t) are linearly independent with 
probability > (1 - )'i+---+'" . 

By the condition that 0(61,^5^), 0(et,it) are linearly independent, let's 
extend them to a basis of F*", that ui — -0(61, i^), . . . ,ut = ip{et^it), ""t+i, . • . , Wr G 
F^ Then 

Pr{V'(ei,ii+i), 0(e2,i2), . . . ,->p{et.it) are linearly independent | • • • }, 
= Pr{0'(ei,ji+i) ^ span(0'(e2,j2), ■ • ■ , V'(et,»J) | ■ ■ ■ } 
> Pr{l+ J2 Q-V;(e,)|ui ^0 I •••} 

e:hcad(ei )— tail(ei, + 1 ) 

_ j 1, if 0(ei)|„i = for every ei 

|l-l/|F|, otherwise. 

□ 

Combining Lemma [1] and Lemma [2l we conclude that a random linear net- 
work coding can achieve the maximum flow upper bound in multicast network. 

Also, from the above lemma by keeping track of |r| collections of paths and 
taking a union bound, we claim that as long as |F| > |r|, the probability that 
every sink can successfully decode is nonzero, which implies that there exists 
a linear network coding solution achieving the maxflow upper bound, which is 
first proved in In |7j, Feder, Ron and Tavory proved a lower bound of size 
^/2N{1 — 0(1)) by information theory arguments. 

The next theorem is our main result, which characterize all achievable rate 
for multi-source multi-sink network that random linear network coding will 
work. And it reveals a dichotomy in random network coding: for a given rate, 
the random linear network coding either succeed with probability — 1, or with 
probability — when the size of the coding field goes to infinity. 
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Theorem 3. Rate (ri, . . . ,r,„) G is achievable with probability -> 1 when 
|F| — > +00 by random linear network coding if and only if, for every t ^ T , 

maxfiow(5*\ |J {s*},i)+ ^ = maxfiow(5*, (1) 

iGdcm(i) iGdcm(i) 

Morevoer, if the above condition is not satisfied, a random linear network 
coding will succeed with probability — >■ when |F| — ^ +00. 

Proof. For the "if" part, assume ([l} is satisfied, we need to show a random 
hnear network coding can achieve rate (ri, . . . , r™) with probabihty — >■ 1 when 
|F| goes to infinity. 

Fix any sink t e T, it's enough to prove with probability — )■ 1 (|F| — !■ +00), 
sink t can decode all symbols from sources dem(i), i.e., 

{hi j : i e dcm(i), 1 < j < ri} C span{?/'(e) : head(e) = t} 

For convenience, let di = Etedcm(t) '^i' and da = maxflow(S'*\Uigdom(t)'t'*» 
And let i7i be the subspace of F'' spanned by bij with i e dem(i), j S [r^], and 
let U2 be the subspace spanned by bi j with i ^ dem(t), j £ [ri], which is the 
complement of Ui . 

Since maxflow(S'* \ UiGdem(t){'^i }' ^) ~ "^s, by maximum flow minimum cut 
theorem, there exists an {S* \ Uiedcm(t){'5i })"^ '^^^ {Vs,Vt) with size da, where 
we denote by ei, . . . , e^^ all the edges in cut (Vs, Vr)- By LemmadJ we know 
that, for every e with head(e) — t, 

V'(e)|(72 e span{i/) (6^)1(72 : i = 1, . . . , ^2}. 

Note that, ■ip[e)\u^ G Ui. Thus, 

V'(e) e ?7i + span{V'(ei)|;72 : i = 1, . . • ,^2}, 

where dim(?7i + span{'0(ei)|[/2 : i = 1, . . . ,^2}) = di + ^2- 

On the other hand, by Lemma [2] and the condition that maxflow(S'*, i) — 
di + 1^2 , we claim 

rank{'0(e) : head(e) — t} > di + d2 

holds with probability 1 (|F| — > +cxd), which implies Ui + span{'0(ei)|c/2 : i = 
1, . . . , ^2} — span{'0(e) : head(e) = t}, i.e., the random network works. 

For the "only" if direction, let's assume for contradiction that di + d2 ^ 
maxflow(S'*, i) and sink t can decode all symbols bij with i e dem(i), 1 < j < ri. 
Noting that di + d2 > maxflow(S'*, i) always holds by the definition of fiow, we 
may assume di + ^2 > niaxflow(S'*, i). 

By Lemma [T] and Lemma [21 we have 

rank{'(/;(e)|[/j : head(e) = t} = di, (2) 

and 

rank{?/'(e)|[/2 : head(e) = t} = d2 (3) 

hold with probability — > 1 when |F| — > +00. Using Lemma [2] again by taking 
= 5", we have 

rank{V'(e)|(72 : head(e) = t} < maxflow(S'*, t) < di + d2. (4) 
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If t can decode all symbols from sources dem(i), i.e., 

{bij : i e dem(t), 1 < j < r^} C span{?/;(e) : head(e) — t}, 
then, by ([3]), we have 

{tp{ei)\u2 : 1 < * < d,2} C span{-0(e) : head(e) = t}, 
which implies ia.nk{ip{e) : head(e) ~ t} > di + d2, contradicted with Q. □ 

By the above theorem, it's easy to verify the achievable rate of random linear 
network coding is monotone, i.e., the achievability of (ri,...,rm) implies the 
achievability of {r[, . . . , r^) as long as < for alH = 1, 2, . . . , m. 

For further research, we feel the following question (not well formulated) is 
interesting. 

Open Question. For general multi-source multi-sink network coding, is 
there any computationally feasible solution which achieves better performance 
than random network coding? 
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